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Abstract 

We prove the sum of squared logarithms inequality (SSLI) which states that for nonnegative 
vectors x,y G M” whose elementary symmetric polynomials satisfy ek{x) < ek{y) (for 1 < 
k < n) and en{x) = en{y), the inequality ^ holds. Our proof of this 

inequality follows by a suitable extension to the complex plane. In particular, we show that 
the function / : M C C"' —>• M with f{z) = has nonnegative partial derivatives 

with respect to the elementary symmetric polynomials of z. This property leads to our proof. 
We conclude by providing applications and wider connections of the SSLI. 
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1 Introduction 


The sum of squared logarithms inequality (SSLI) arose first as a scientific issue in 2012 
proving the following optimality result 


231 while 


inf symLogQ^F p = inf inf symF P = log VF^F\\^, (1) 

Qeso(n) QeSO(n) YeK"X" 

exp(y) = Q^F 


where Y = LogX denotes all solutions of the matrix exponential equation exp(y) = X, || • || 
denotes the Frobenius matrix norm, and symX := ^{X + X^). 

The SSLI (formally stated in Theorem 11.21) has been investigated in a series of works. In 2013, it 
was examined more closely by Birsan, Neff and Lankeit in [7|, who found a proof for n G {2,3}. 
For n = 3, the inequality can be written as follows: let Xi,X 2 ,X 3 ,yi,y 2 ,y 3 > 0 be positive real 
numbers such that 


X 1 + X 2 + X 3 < yi + y 2 + y3, 

Xi X 2 + 0 : 10:3 + X 2 X 3 < 2/1 1/2 + 2/12/2 + 2/2 2/3, 

Xi 0 : 2 X 3 = 2/12/2 2/3- 

Then, the sum of their squared logarithms satisfy the following inequality: 

(logXi)^ + (logX2)^ + (logX2)^ < (log 2/1)^ + (log 2/2)^ + (log 2/3)^ 

In 2015, Pompe and Neff proved the SSLI for n = 4, based on a new idea that did not extend 
to higher dimensions without further complications. To state the SSLI for arbitrary n, we hrst 
recall 

Definition 1.1. Let x G M"". We denote by efc(x) the fc-th elementary symmetric polynomial, i.e. 
the sum of all (^) products of exactly k components of x: 

(^k{x) := ^ XjjXij ...Xi^ for any A: G {1,..., n} . 

Note that ei(x) = xi + X 2 + • • • + x„ and e„(x) = xi • X 2 • ... • x„. 

We also write R+ := (x G M | x > 0} and M_ := (x G M | x < 0} and set = (M+)”. 

Theorem 1.2 (Sum of squared logarithms inequality). Let n G N and x, 2 / G M” such that 

^k{y) for all A: G (1,..., n — 1} 

Sniy)■ 

n 

Y^i^ogyif. 

i=\ 

minimization problem: 


and 


efc(x) < 
en(x) = 


Then 


5^(logXi)2 < 


i=l 


This statement can equivalently he expressed as a 
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For X G M”, let 


£x ■= { 2 /e I efc(x) < efc(j/) for all k e {I,... ,n - 1] and e„(a;) = e„(y)} . 


Then 


inf 

y&Sx 


2 = 1 


(log Vi 


(log Xif . 

2=1 


Since (log^j)^ > 0 for all i G {1,.. ■,n}, the expression is bounded below by 0, so the inhmum 
clearly exists. Note that Sx is a non-convex set. 

Remark 1.3. If the equality assumption e„(x) = en{y) in the last elementary symmetric poly¬ 
nomial is replaced by the weaker requirement en{x) < en{y), then the conclusion no longer holds 
in general. As a counterexample, consider x = (e“^,..., e~^) G M” and y = (!,...,!) G M"; then 
efc(x) = < (") = ekiy) for all fc G {1,..., n}, but Er=i(log^*)^ = n > 0 = Y.i=i(^ogyif. 

Neff, Nakatsukasa and Fischle showed that the SSLI implies ([T]). 

The proof of the SSLI presented in this work was motivated by the second named author, who 
published the SSLI conjecture (at that point) on the internet platform MathOverflow j^. The 
hrst named author extended the problem to the complex plane and presented a sketch of a proof. 

Miroslav Silhavy (Czech Academy of Science) considered the problem after private communication 
with P. Neff and provided a characterization of functions that satisfy E-monotonicity. Interestingly, 
shortly after seeing L. Borisov’s solution, one of the authors (S. Sra) suggested via email that “a 
full generalization of this idea should be possible via Pick-Nevalinna theory.” This idea is natural, 
and the details were independently discovered and worked out by M. Silhavy j^; it is also worth 
noting that actually Jozsa and Mitchison [l^ foreshadowed the Pick function based approach to 
proving such inequalities but did not develop it fully. Our remarks here merely outline the historical 
sequence of events (to our knowledge), and to highlight the remarkable fact that like many other 
problems in mathematics, the SSLI also witnessed several essentially simultaneous solutions; each 
exposing different aspects of it and thus contributing to our understanding. 

In this paper we give a self-contained exposition of our new methods towards proving the SSLI. 
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Proof of Theorem 


1.2 


In our further calculations, we will use the following lemma and the resulting corollary. 

Lemma 2.1. Let zi,..., Zn be pairwise different complex numbers and k G {0, ..., n — 1}. Then 


E 




^j) 


t^ 

- z,) 


for all t E C \ {zi ,..., Zn} ■ 


( 2 ) 


For n = 3 and A; = 2, for example, the equality reads 

_I_ ^ _I_^ 

{a—h)(a—c){t—a) ' {h—a){h—c){t—h) ' {c—a){c—h){t—c) {t—a)(t—h)(t—c) 
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Proof. Let k G {0,..., n — 1}. Then according to the theorem of partial fraction decomposition, 
there exist complex numbers oi,..., such that 






for alH G C \ {zi,..., Zn} ■ 


2=1 


For given r G {1,..., n}, multiplying both sides of the equation with t — Zr yields 


E 


t — Zr 


- Zj) ^t-Zi 


tti for alH G C \ {zi,..., Zn} ■ 


Taking the limit t ^ Zr on both sides of the equality, we hnd 


UU^Zr-Z,) 


= Or 


Corollary 2.2. Let zi,..., G C be pairwise different complex numbers. Then 


= 0 for all k & {Q,... ,n — 2} . 


_^ 


For example, we hnd for n = 4 and k = 2\ 


+ 


+ 


+ 


d? 


[a—h){a—c){a—d) {b—a){b—c){b—d) {c—a){c—b){c—d) {d—a){d—b){d—c) 

Proof. Let k E {0,..., n — 2}. Using equality ([2]), we obtain 


= 0 . 


S (^ - Zi) YY^zliz.. 






I 


) uU(^-z,) 


for all t G C \ {zi,..., Zn-i} ■ 


( 3 ) 


( 4 ) 


( 5 ) 


Setting t := Zn and rearranging the equation yields the statement. 


To introduce the basic idea of our proof, we hrst recall the relationship between a vector z := 
{zi, Z 2 , ■ ■ ■, Zn) and the vector of the elementary symmetric polynomials evaluated at z, i.e. ( 61 ( 2 ;),... 
To this end, we dehne the characteristic polynomial of a linear map with the invariants ei,..., e^,: 

n 

K{t) := r-eif"-i + e 2 t "-2 + ... + (-ire„ = T + ^(- 1 )'= e^, 

fc=i 


Since if he has the roots zi,..., we can write 

he{t) = {t- Zi){t- Z2) ...{t- Zn) 

= — (zi + . . . + Zn)P' ^ + (zi Z 2 + • • ■ + Zn-l Zn)t"' ^ + . . . + ( —l)"'Zi . . . Z„ (6) 

= r - ei(z)r-i + e2(z)r-2 + ... + {-l)^en{z). 


,e„(z)). 
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In this paper we will study different restrictions in the co-domain of the elementary symmetric 
polynomials. However, we will always assume that this co-domain is positive and real. It is also 
convenient to introduce an ordering of the complex numbers in order to ensure the uniqueness of 
the coefficient vector corresponding to a given set of roots. We therefore dehne the set 

:= G C" I Re(zi) > .. . > Re( 2 :„) , Rez* = ReZj+i ImZi > ImZj+i Vf G {1,... ,n —1}} , 

which contains only ordered vectors and thereby excludes all rearrangements. Furthermore, we 
define the set 

M := { 2 ; G C"'*'I ei(2;),e2(z),... , 6 ^( 2 ;) G M+} 

of all ordered vectors with exclusively positive elementary symmetric polynomials. In contrast to 
previous work on the SSLI, we extend our view directly to complex roots in M, which provides 
the crucial advantage. 

Lemma 2.3. The function M —)■ that maps each vector z & M onto the coefficient vector 

e corresponding to the uniquely determined polynomial he with roots zi,..., Zn is continuous and 
bijective. Its inverse function is continuous as well, and we denote it by 

(f: —)■ C , (ci,... Cn) e-)- ip{ei ,..., e^) . 

Furthermore, each vector [z\,..., Zn) & M contains only positive real numbers and complex conju¬ 
gate pairs of numbers. 

Proof. The elementary symmetric polynomials ei{z),..., en{z) evaluated at 2 ; are exactly the co¬ 
efficients ei,...,e„ of the polynomial hg with the roots zi,...,Zn. The elementary symmetric 
polynomials are obviously continuous. 

On the other hand, applying the fundamental theorem of algebra, we know that hg has exactly n 
complex roots, all of which are either real or complex conjugate pairs. It is easy to see that all 
real roots must be positive: since the polynomial hg{—t) = F -\- t"'~^ > 0 for all x G M+, 

because all are positive. Thus hg{—t) has no positive and therefore hg{t) has no negative real 
roots. A proof of the continuity of (p is shown in [^. ■ 

We now come to the proof of the SSLI. The main 
idea has already been pursued in prior attempts to 
prove the inequality: instead of directly working 
with the function f{z) := the 

set M of roots, we consider the composition f o ip 
which depends on the elements e G T of a suitable 
set of coefficients T C M”. Of course, we have to 
choose T in a way such that (/ o p){e) G M for all 
e G T. 

The proof of the SSLI now can be divided into 
two steps: 

d(fop) 

1.) We show that —--> 0. 

OCk 

5 



Figure 1: The graph of a function g 
with non-negative partial derivatives but 
non-convex domain of definition; note that 
g{x) > g{y), although Xi < yi for i G {1, 2}. 






2.) We find a path 7: [0,1]—)- <fiT) with 7(0) = 

X, 7(1) = y such that -;-efc(7(s)) > 0 for all 

ds 

s G (0,1) and /c G {1 ,..., n — 1} as well as 
^671(7(5)) = 0 for all s G (0,1). 


Note carefully that condition 1.) alone is not sufficient. To understand this, let us consider the 
graph of a function g: D ^ with a non-convex domain D C as shown in Figure [TJ 

As we see, even though the function only has non-negative partial derivatives, it is not true that 
g{x) < g{y) for each pair x,y ^ D such that (componentwise) x < y. In particular, we cannot find 
a path connecting x to y which is increasing in all components. 


We therefore want to choose an appropriate domain T C in order to prevent these complica¬ 
tions. The problem is that for a too restricted choice of T, we do not easily find suitable paths 
satisfying condition 2.), while if T is chosen too large, it becomes more difficult to prove that 
condition 1.) holds on all of T. 


In [2^, Pompe and Neff managed to prove the SSLI for n 


4 by choosing 


T = {(ei(z),...,e„(^)) UgR^^}; (7) 

the authors in fact do not to limit the inequality to the function f{z) := i but to 

prove it for a whole class of functions. For these /, they show condition 1.), i.e. the non-negativity 
of the partial derivatives with respect to the fc-th elementary symmetric polynomial, by showing 
for each point on the path from condition 2.) that 

n , n \ 

s := rife---) so- ( 0 ) 

i=i 7=1 ' 


Although their choice of the domain T is not convex, they demonstrate the existence of paths from 
X to y without constructing them explicitly: they show that a path which satisfies condition 2.) 
can be continued and must therefore reach y after starting at x. However, for n > 4, this method 
seems impractical due to the amount of computational effort required. 

In this paper, we consider the convex set 


T := R” . ( 9 ) 

By this choice, satisfying condition 2 .) is rather trivial; we take the straight line 7 in T from 
e(x) to e(y), i.e. 7(5) := s ■ e{x) -|- (1 — s) • e{y), and consider the curve 7 : [0,1] —)■ M with 
7(5) = 75(7(5)). Of course, it is difficult to explicitly characterize 7, but this is not necessary for 
our proof. It therefore only remains to verify condition 1 .), which requires some more elaborate 
methods. 

As indicated earlier, we define the function 

n 

/: (C\ (R_ U { 0 })'^ C with f{zi,...,Zn) ZiY , ( 10 ) 

i=l 

where log denotes the main branch of the complex logarithm, which is defined for C \ (—cxo, 0 ]. 


6 




Lemma 2.4. The composition f o ip is a real-valued function on M”, and takes the form 


f o p: Wf with (/ O p){e) := f{pi{e),.. .,(pn{e)) . 

Proof. From Lemma |23]we already know that the components of (<pi(e),..., ^n{e)) are either real 
or pairs of complex conjngates. For snch pairs, 

(log(pi(e))^ + (log(pi(e))^ = (log(pi(e))^ + (log(pi(e))^ = (log+ (log(pi(e ))2 eM. 

Since (log(pj(e))^ is real-valned for all real components Pi{e) as well, it follows that 

n 

(/ ° T){ei{z),.. .,eniz)) = ^(logZj)^ G M. ■ 

i=l 


In order to prove condition 1.), we need to compnte the inner derivative of the composition fop: 


Proposition 2.5. Let e := (ci,..., e^) G be such that all components of p{e) (i.e. the roots of 
he) are pairwise different. Then p is differentiable at e with partial derivative 


d‘Pi, . _ * 

deP> - WPAvde) - fi(e)) 


for any k E { 1 ,..., n} . 


Proof. Using the notation /i(ei,..., e„, t) := he(t), we can characterize the fnnctions pt, i G 
{1 ,..., n}, implicitly by the eqnation 

0 = h(ei,..., e„, pi{ei,..., e„)) for all e = (ci,..., e„) G M+ . 

According to the implicit function theorem, pi is differentiable at a point e G if ^(e, v?i(e)) 7 ^ 0. 
By assumption, the roots 9 ?i(e),..., Pn{,e-) of he are pairwise different; in particular, the root Pi{e) 
is simple and therefore ■§ih{e, ^i{e)) 7 ^ 0. We differentiate 


n—1 




( 11 ) 


dh 


dpi 


= (-l)Vi(e)“-‘ . 1 + -^(e.yy.ie)) ■ ^(e). 
and rearrangement yields the desired equality, since ^(e, (pj(e)) = Y\p=i{}Pi{^) ~ 




After this preliminary work, we can formulate condition 1.) in our context: 

Conjecture 2.6. Let e = (ci, ..., Cn) G M" . Then the composition f o p is differentiable and the 
partial derivatives ^^^(e) for A: G {1,..., n — 1} are positive, i.e. 


d{f o p) 

dck 


(e) > 0 


for all e G and any k E {I,... ,n — 1} . 
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We will prove Conjecture 12.61 under the additional assumption that all components of (p(e) are 
pairwise different (i.e. that hg has only simple roots). With this restriction, the assertion does 
not imply condition 1.), but we will compensate this drawback with some additional work in 
Proposition 12.91 as well as Lemmas 12.101 and 12.111 


Lemma 2.7. Let e = (ci,..., e„) G be such that all components of (p{e) are pairwise different. 
Then 


d{f o <p) 
dck 


{e) 


71 / / \ \ ^^ 1 

^ U^iiTjie) - Tiie)) 


\ogip{e). 


( 12 ) 




Proof. Because the components of ip{e) are pairwise different, ip is differentiable at e according 
to Proposition 12.51 Then f o p is differentiable at e as well, and we can determine the partial 
derivatives for k G {1,..., n} using the chain rule: 


d{f o p) 
dck 


(e) 




2=1 


E 


dck 


2\ogpi{e) {-lY"^^pi{eY ^ 


^ - pj{e)) 




n+k ,^ (^\n—k—l 


(_l)n+A: 


tr (-i)"-^n;=i(v^.(e)-^,(e)) 


log Pi{e) 


3^1. 


- 2 E 




n—k—1 


3#* 


logif>i(e) ■ 


(13) 


We now want to show that the partial derivatives oi f o p are positive. The expression calculated 
in Lemma 12.71 also appeared in other work. For example, Mitchison and Josza in 2004 point out 

that 


in the appendix of [19 


(-wE 


^ n 

i=i ■*-L= 




Zi) 


> 0 


(14) 


for all g G 2,..., n. If we let Zi = Pi{e), we seem to have already reached our goal. However, 
Mitchison und Josza prove inequality flTTl) only for the case that zi,..., Zn are the eigenvalues of a 
Gram matrix, which is necessarily symmetric and positive definite. Their ineqality can therefore 
only be applied to positive real numbers zi,..., Zn, while in our case, pi{e), ..., Pn{e) might also 
include pairs of complex conjugates. 

We close this gap by showing: 

Lemma 2.8. Let {zi,..., Zn) G M. Then for all r G {0 ,... ,n — 2}, 


-E 




it - ^.) 

3i=i 


^og Zi = 




df > 0. 
























Proof. We observe that 


n n— 1 n { ^ ^ 

]^(t + - 2 ;j) = p'^^ek{z)P'~’^ + '^Zj > min 11”, 2 ;^ I > 0 

j=l k=2 j=l ^ j=l ^ 


for all t G M+. Therefore, since r < n — 2, 


n-=i(^+^. 


dt < 


/o n;=i^. 


-df+ / p-^dt < 


<t- 


n;.r-. 


+ 1 . 


(15) 


Thus the integral converges for r G {0,..., n — 2} and, since j^n > 0, its value is positive. 

We now use ([2]) and exchange the order of summation and integration, where in evaluating the inte¬ 
gral at its “upper boundary” we use that log(t-l-Zj) = logt-l-log(l-l-y), whereas this decomposition 
is not applied at t = 0: 


-* 1 




(-Zi)’ 


ui.iit+^j) Jo ■jG'( i+=i)n”.i((-zi)-(zj)) 


dt 




[ — Zij 


■ log{t + Zi 

=logt+log(l+^) 


t^OO 


t=0 


h" (E rrw^ +E ryn^ 

n,=i(% - tr n,=i(^. - 


Zi) 


log 1 + 


It is easy to see that lim^^oo log (1-1-^)= 0. 
This immediately implies that 


(16) 


-E 


-Zi] 




logZi 




= 0’ because otherwise limt^oo ELi 

would diverge, in contradiction to the already established convergence of the integral. This com¬ 
pletes the proof. ■ 


Proof of Conjecture Id. 61 in the case of pairwise different roots ‘Pi{e). 

We can now conclude: for all n — fc — 1 G {0,..., n — 2}, that is for A; G {1,..., n — 1}, 


^(/ o p) / \ Prop l2.7l 


dck 


- 2 E 




n—k—1 


Prop [2.8l 2 


t - Pi{e)) 

^n—k—1 


log(p(e) 


/o + 


dt > 0. 


This shows that condition 1.) holds on nearly the entire domain T; the only problems occur for 
those points e G T for which has multiple roots. The set T is indeed convex, meaning that we 
can easily construct a path as described in condition 2.), but since such a path may pass through 
points with multiple roots, it is not necessarily differentiable everywhere. However, as the next 
proposition shows, under suitable assumptions a straight line in T contains at most hnitely many 
of these problematic points: 
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Proposition 2.9. Let po and pi be polynomials of degree n such that at least one of them has 
only simple roots. Then there are only finitely many s E [0,1] such that the polynomial Ps '■ = 
(1 — s) Po + spi has multiple roots. 


Proof. Let p = a nr=i(^ — Xi) = X]r=o ^ polynomial of degree n with roots zi,... ,Zn 

and coefficients ao,..., a„. The discriminant of p is defined as 


D{v) := n 

l<2<j'<n 


( 17 ) 


and is zero if and only if p has mnltiple roots (see also 17|, p. 204]). The discriminant is a 
symmetric homogeneons polynomial in the variables zi,...,Zn and thns can be expressed as a 
homogeneons polynomial in efc(zi,..., i.e. in the coefficients a^li] 


The coefficients of ps are polynomials in s, so D[s) ;= D{ps) is also a polynomial in s. If D{s) is 
the zero polynomial, then both po and pi mnst have mnltiple roots. On the other hand, if both 
Po and Pi do not have mnltiple roots, then D{s) is not the zero polynomial and is zero only for 
hnitely many s E [0,1], and thns Ps can have mnltiple roots only for hnitely many s G [0,1]. ■ 


For vectors x,y E M!]:, we can therefore directly prove the SSLI nsing conditions (1) and (2) 
only nnder the additional assnmption that at least one of the two vectors has pairwise different 
components. In order to circnmvent this limitation, we hrst show a strict version of the SSLI; 

Lemma 2.10. Let x = (xi,..., x„), y = (pi,..., yn) E M” be such that xi > X 2 > ... > Xn, 
yi>y 2 >--->yn, ek{x) < ek{y) for all k E {l,... ,n - 1} and e„(x) = en{y). Then f{x) < f{y) 
with f as in OB, !.e. 


fin = = /(»)■ 

i=l i=l 


Proof. Consider the path = (e^,..., e^) C for s G [0,1] with 

el := (1 - s) efc(x) + sefc(|/). 

Then e° = e(x) and = e{y) as well as efc(x) < ek{y) for all A: G {1,..., n — 1} and e„(x) = en{y). 
Since Xi > X 2 > ■ ■ ■ > Xn, the polynomial hgo has pairwise different roots. According to Proposition 
12.91 there is only a finite nnmber of s G [0,1] snch that /ig® has pairwise different roots. Then by 
Conjectnre 12.61 / o p is differentiable at all bnt hnitely many points along the path s e^, and 


k=l 


P) 


dei 


ds 


E 

k=l 


d{f o ip) 


• (efc(2/) - efc(x)) > 0 



(18) 


Thns the continnity of the mapping s t— )■ (/ o (p)(e<j) implies its strict monotonicity on [0,1], and 
therefore 

/(x) = (/ o (p)(e(x)) = f{ip{eo)) < f{piei)) = (/ o p){e{y)) = f{y) . ■ 

^For example, the polynomial q{t) = P—aP+bt—c has the discriminant D{q) = P 6^—4 6^—4 c+18 a 6c—27 P. 
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In the next step, we use the continuity of the elementary symmetric polynomials in order to show 
the strict inequality for those x,y whose components are not pairwise different. 

Lemma 2.11. Let x = y = {yi,... ,yn) G M" be such that ek{x) < Ckiy) for all 

k E {I,... ,n — 1} and en(x) = en(y). Then f{x) < f{y) with f as in ffTOj) . i.e. 

n n 

fix) = 5^(logXi)2 < ^(logyi)^ = f{y). 

1=1 i=l 

Proof. If the components oi x = {xi,..., Xn) are pairwise different, then we can assume Xi> X 2 > 
... > Xn and yi > y 2 P ■ ■ ■ P yn after rearrangement. In this case. Lemma [2.101 shows that the 
inequality holds. 

Otherwise, we need to slightly change the identical component pairs of x in order to apply Lemma 
12.101 Because of the continuity of the elementary symmetric polynomials, if the changes to the 
components of x are sufficiently small, then the resulting vector x' still satisfies the inequality 
ek{x') < Ckiy) for all k G {!,...,n — 1}. In order to preserve the equality e„(a;') = en{y), we 
choose x' := Xi (1 + e) and x'- := xj for each component pair Xi = xj and small e > 0. So we 
hnd 

(logx-)^ + (logx' )^ = (logXi + log(l + e))^ + (logXj - log(l + e))^ 

= 2(logXi)^ + 2(log(l + e))^ > (logXi)^ + (logXj)^ . (19) 

If we dissolve all pairwise equalities in this way, then we can apply Lemma 12.101 to find 

"■ "■ 19 1 ni 

^(logXi)^ < ^(logx')2 < ^(logyi)^ ■ 

i=l 2 = 1 2 = 1 

Using the continuity of the elementary symmetric polynomials and the logarithmic function we 
are finally able to extend Lemma [2 .11 1 and thus prove the SSLI without any restrictions: 

Theorem 1.2 (Sum of squared logarithms inequality). Let n eN and Xi,X 2 , ■ ■ ■, x„, yi, 2 / 2 ) ■ ■ ■, ?/n ^ 
M_i_ such that 

efc(x) < Ckiy) for all k E {1,... ,n - 1} 
and On(x) = Cniy ). 

n n 

Then < Y^{\ogyif . 

2=1 2=1 


Proof. Choose S C {l,...,n — 1} such that efc(x) = Ckiy) for all A: G S' and ek{x) < Ckiy) for all 
k E {I, ... ,n — 1} \ S. Furthermore, for A: G {1,..., n} and m eN, let 


e 


m 

k 


ek{x) - f if /c G S', 
efc(x) otherwise 


and X™ := (p(e™). 


Then 0 < ek{x^) < Ckiy) for all A; G {1,... ,n — 1} and all sufficiently large m G N as well as 
e„(x™') = en{y), thus Proposition 12.111 yields sufficiently 

large m eN. Since lim^-^-oo = x, we find 


5^(loga:i)^ 

2=1 


lim ^(logx™)^ < 

m—>-co ' ^ 

2 = 1 


2=1 
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3 Applications and Connections 


3.1 Relation to entropy 


Jozsa and Mitchison [l^ study entropy and “subentropy” from the perspective of quantum infor¬ 
mation theory. They introduce and investigate partial derivatives of these quantities with respect 
to Cfc, and establish their unrestricted nonnegativity. They view entropy as a function of the sym¬ 
metric polynomials and use analytic continuation to extend the dehnition to the entire set of 
nonnegative real e^. Consequently, they obtain integral representations of entropy (and suben¬ 
tropy) from which the desired nonnegativity properties follow. More interestingly, they study 
higher partial derivatives w.r.t. the elementary symmetric polynomials and establish complete 
monotonicity of entropy and subentropy. Similar higher order monotonicity properties can be 
established for f{x) = 


In [l^ Dannan, Neff und Thiel discussed applications of the SSLI towards the entropy of probability 
distributions. We now prove a statement very similar to the entropy expression of two vectors: we 
repeat our procedure from the last section with 


(C\M<o)’"-)■ C, g{zi,... ,Zn) =-'^Zi\og{-Zi) (20) 

i=l 


instead of /, but otherwise identical notation and dehnitions. The composition g o (p on is 
once more a real-valued function: x G M_ and y G M^o imply x log(—x) G M, while for complex 
conjugate pairs we hnd 


Xi log Xi + Xi log Xi = Xi log Xi + Xi- log Xi = Xi log Xi + Xi log Xi G M . 


Analogously to the last section, the function g o p can be expressed as 

n 

{g op): Rl {f o p){e) = f {pi{e),..., pn{e)) = -^(pi(e) log pi{e), 

i=l 


and for Xi,..., G M+ we have {g o p)[ei{x), e 2 (x),..., e„(x)) = - X)r=i ^ogXi. 
We now determine the partial derivatives: 




= 5^^(<^i(e),...,V?n(e)) ^(ei,...,e„) 

2=1 ^ 
n 

^(log(pi(e) + 1) 


dck ^ dzi 

2=1 

PropEH " 


2=1 

n 


Y. 


dek 

(_l)fc+i p^{e)^-’^ 

nTWwWT)) 


iiti 

¥^i(e)) 


(log pi{e) + l) 


( 21 ) 


, n—k 




log pi{e) - 


{-‘fiie)) 


n—k 




- f.w) 
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By (jlj), the second sum is zero for n — A; G {0,..., n — 2}, i.e. for A: G {2,, n}. Thus we obtain 


d{g o 
dck 


[e = - 




(-V7i(e)) 


n—k 


1 nj=i(‘^i(e) - ^i{e)) 


. . Lemmi2.81 
log^iie) > 0 


for n — A: G {0,..., n — 2} or, equivalently, for A: G {2,..., n}. 


Remark 3.1. It should also be noted that = 2 

OCk ^^k+1 


As with the SSLI, we can now infer the monotonicity of go tp along straight lines, which yields the 
following result. 

Corollary 3.2. Let n G M and xi,X 2 , ■ ■ ■, Xn, yi, 1 / 2 , • • •, 2 /n > 0 such that 


and 


Then 


ei{x) = ei{y) 

ek{x) < ek{y) for all k e {2,... ,n} . 

n n 

-^XiXogXi < - ^ yi log yi. 

i=l i=l 


Weakening the equality in the hrst condition to ek{x) < efc( 2 /) for all A: G n} again allows for 

obvious counterexamples similar to the SSLI; For x = (e“^,... e~^) G and y = (1,..., 1) G 
the weakened condition is true, yet — log 2 ;* = ^ > 0 = — Vi log?/i- 

See also the discussion in Section |3l 


3.2 The SSLI in terms of matrix invariants 

Let U G Sym+(n), where Sym_|_(n) C denotes the set of positive dehnite symmetric n x n- 

matrices. Then U is orthogonally diagonalizable with real eigenvalues Ai,...,A„ > 0. The k- 
th invariant Ik{U) of U is dehned as the A:-th elementary symmetric polynomial of the vector 
X{U) = (Ai,...,A„), i.e. Ik{U) := efc(A(17)); (thus, h{U) = tiX and 4(C) = detC). 

The SSLI can be equivalently expressed in terms of these invariants of positive dehnite symmetric 
matrices. 

Theorem 3.3. Let 4, 4 G Sym_|_(n). If Ik{U) < Ik{U) for all k E {1,..., n—1} and det 4 = det U, 
then II log 4|p < || log 4|p, where log is the principal matrix logarithm on Sym_|_(n) and || . || denotes 
the Frobenius matrix norm. 

Proof. Since || log4||^ = X]r=i(log ^*(1^))^) using the SSLI we immediately have 

n n 

||log) 7 ||" = J](logAi(f/))" < J](logAi(f/))" = ||logC/f . ■ 

i=l i=l 
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Theorem 13.31 can be applied directly to the quadratic Hencky energy 


l 2 I ^ u n „ 7-7-M2 111 „rrll 2 ^ 


Wu{F) = /x|| dev„log[/f + ^ [tr(logl7)]^ = /x || log t/f + ^ [log(det t/)]^ 


which was introduced into the theory of nonlinear elasticity in 1929 by H. Hencky [13[. Here, 
F G GL’''(n) is the deformation gradient, GL’'"(n) is the set of invertible n x n-matrices with 
positive determinant, U = Vf^F is the right stretch tensor and dev„ log U = log U— ^ tr(log U) ■ 1 
is the deviatoric part of the Hencky strain tensor log U. The material parameters fi, X with /i > 0 
and 3 A + 2 yu > 0 are called the Lame constants, while k > 0 is known as the bulk modulus. The 


Hencky energy has recently been characterized by a unique geometric property [27|, [24, l28[: it 


measures the squared geodesic distance of F to the special orthogonal group SO(n) with respect 
to a left-GL(n)-invariant, right-SO(?7,)-invariant Riemannian metric on GL(n). 

In terms of the quadratic Hencky energy. Theorem 13.31 can be stated as follows: 


Corollary 3.4. Let F,F e GL+(n) with U = VWF and G = Vf^F. If det U = detU and 
h{U) <h{U) for alike n-1}, then Wh{F) < VLff(F). 


Remark 3.5. Corollary 13.41 means that Wh satishes a version of Truesdell’s empirical inequalities 


32, pages 158, 171]. 


In a similar way. Corollary 13.21 can be stated in terms of matrix invariants. 

Theorem 3.6. Let U,U E ^m_|_(n). IftiU = trU and Ik{U) < Ik{U) for all k E {2,...,n}, 
then {U,\ogU — 1 ) > {U ,\ogU — I), where {X,Y) = denotes the canonical inner product 

of two n X n-matrices X and Y . 


Proof. Corollary 13.21 implies ATf/1 logAHHl > X]r=i Using the condi¬ 

tion trU = tr U, we compute 

n 

{U, log [/-!)= {U, log U) - {U, t) = J2 ■ log 

i=l 

n 

> Y,HU) ■ log X^{lJ) -tiU = {U, logU- 1). ■ 

i=l 


Remark 3.7. The constitutive law induced by the hyperelastic energy potential 

Wb{F) = {U,\ogU-l) = {VF^F,\ogVF^F -1) 


is a special case of Becker’s law of elasticity, which was introduced by the geologist G.F. Becker in 
1893 in a way remarkably similar to H. Hencky’s deduction of the quadratic Hencky energy 25 


Becker’s elastic law is hyperelastic (i.e. admits an energy potential) only in the lateral contraction 
free case, which is described by the energy function Wb- 


Since tr(XlogX) = Aj(77) logAj(X), we can translate the last Theorem 13.61 into a statement 

for the quantum von Neumann entropy. 

Corollary 3.8. Let X,Y E Sym+(?T,) be density matrices, so that tr X = trH = 1. If Ik{X) < 
Ik{Y) for all k E {2 ,... ,n}, then the von Neumann entropy — tr(XlogX) < — tr(y logH). 
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3.3 Application to geodesic distance on Sym^_(n) 


The convex cone Svni4- (n) is freqnently also viewed as a Riemannian manifold endowed with the 
Riemannian metric (^, EH 0, l2ll . 30 


gciX,Y) = tTiC-^XC-^Y), 


( 22 ) 


where C G Sym+(n) and X,Y E Sym(n) = Tc Sym+(n), the tangent space at C. This manifold is 
geodesically complete, and the nniqne geodesic joining Ci,C 2 E Sym+(n) has the closed form j^, 
6 . 1 . 6 ] 

7(t) = fe[0,l]. (23) 

The geodesic distance between Ci,C2 E Sym_|_(n) is given by B 6,1.6] 

distg„d,s,„+(„)(Ci, C 2 ) = II Cl C,-’''’)!!. 

In particnlar, for 6*2 = 1, we obtain the simple formnla 

distgeod,Sym+(n)(C'li l) = II log <^1 f • 

The snm-of-squared-logarithm ineqnalities can therefore be stated in terms of the geodesic distance 
of positive definite symmetric matrices to the identity matrix 1. 

Corollary 3.9. Let hiC ),..., In{,C), h{C ),..., In{C) denote the principal invariants of C,C E 
Sym+(n). If Ik{C) < Ik{C) for all k E {1,... ,n — 1} and In{C) = det C = det C = In{C), then 

distgeod, Sym+(n) (dl, 1) ^ distggod, Sym+(n) (C*? l) • 


Since 1 commntes with every matrix, distgeod, Sym+(n) actnally reduces to the log-Euclidean distance 

distiog_Euciid(C', C) := II logC - logCIl , 

i.e., the Euclidean distance between the principal logarithms of two matrices j^. Thus, the SSLl 
may be equivalently stated for distiog.EucUd- 


3.4 Additional applications 

The SSLl may also find applications in other fields. In the following we list some of these potential 
applications. 

• Recall that Ik{X) = tr A^X, where A denotes the antisymmetric tensor product (Grassmann 
product). Thus, the SSLl for matrices can be equivalently stated in terms of this tensor 
product. This notation immediately suggests an interesting generalization, namely to the 
dual case of symmetric tensor product denoted trV^X = hk{X{X)), where hk denotes the 
k-th complete symmetric polynomial. Thus, we can consider inequalities of the form 

tr < tr for all A: G {1,..., n — 1} and tr V"'X = tr V"'F 

^ F{X) < F{Y) . 

More generally, extensions of the SSLl to other matrix monotone functions may be obtained 
by building on j^ . 
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It might also be of interest to characterize the matrix functions —)■ for which 


i£ II sym^(g^Z)|| = inf ||^(Q^Z)|| = ||^(i?^Z)|| = || J-(//)|| 

QeSO(n) QGSO(n) 


where Z = RH is the polar decomposition of Z, see 18|, l26|] and compare to flTl) . 

The hnal connection that we mention is perhaps the most interesting. Nonnegative elemen¬ 
tary symmetric polynomials of matrices have been studied under the guise of Q-matrices 
(Qo-matrices) [IJ]- These are complex matrices, whose elementary symmetric polynomials 
are positive (nonnegative). These matrices are much more tractable than the better known 
class of P-matrices (i.e. matrices with positive principal minors), which have been exten¬ 
sively studied in matrix analysis and optimization 12, 3, l^- The following theorem of Kellog 


applies to P matrices, and also to Q matrices [14 


Theorem 3.10 ([16|). Let X E be a matrix for which Ik{X) > 0 for k E {1,... ,n}. 

Then, the spectrum of A is contained in the set 


D := |z : I arg^l < tt-1 . 

Moreover, if any eigenvalue of X lies on the boundary of D, then necessarily all symmetric 
functions Ik{X), except In{X), are equal to zero. 


4 Alternative proof of Conjecture 12.6 


In Section[2l we have only shown Coniecture 12.61 under 
the additional assumption that e E R” corresponds 
to pairwise different roots (p(e) which was sufficient 
to prove the SSLI. Nevertheless, it is striking that the 
expression of the partial derivatives 


d(f o t) 

dek 


(e) = 2 





(24) 


can be extended continuously to e G R” with mul¬ 
tiple roots. This strongly suggests that Conjecture 
12.61 indeed holds for all e G R”. We now restate the 
conjecture as a lemma and give a short proof for the 
general case. 



Figure 2: The path C consists of the fol¬ 
lowing four pieces C^, 6*+, Cr and C-. 


Lemma 4.1. For a = (oi,..., a„_i) G R" ^ let ha{z) = z'^ + a„_i z'^ + ai z + 1. Define the 

function f: R"”^ by 

/(ai,...,a„_i) = (log(-^))^ 

_ z£C 
ha{z) = 0 


where log denotes the principal branch of the complex logarithm on C\ R<o and roots are counted 
with multiplicities. Then all partial derivatives ■§^{a) for any a E R"”^ are positive. 
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Note carefully that the roots Zi,..., of ha in the original formulation of Conjectnre l2.6l correspond 
to the negative —zi,, —Zn of the roots of ha in Lemma 14.11 

The function / is well defined, since none of the roots of ha are positive real numbers. Furthermore, 
/ is real-valued, because the roots F of ha only occur as real numbers or complex conjugate pairs, 
which means that the values of (log(—F)) are also real or complex conjugate pairs. 

Proof. We write the function / as a contour integral. In order to accommodate the multiple values 
of log(—;2) we consider the Riemann surface with boundary obtained by gluing two copies of R+ 
to C \ M>o, where R>q :={a;GM|x>0}. We will view the two copies as north and south copies 
M+ depending on whether they are approached from above and from below respectively. Note that 
the log(—z) function extends to the following 


log(-rnorth) = logr - TTZ, log(-rsouth) = logr 7li 
on the north and south copies of M+. 

In a neighborhood of fixed a G for any sufficiently large R and small enough e > 0, we 

can apply the generalized argument principle i to find^ 

/(«) = ^ / 

Jc ha{z) 

where the path C consists of the following four pieces Ce, 6*+, Cr and C-, see also Figure E) 

• Ce is the circle of radius £ around 0 travelled clockwise from £south to £north; 

• (7+ is the line segment from £north to Rnorth on the north copy of the positive real axis; 

• Cr is the circle of radius R around 0 travelled counter-clockwise from Rnorth to i?south; 

• is the line segment from i?south to ^south on the south copy of the positive axis. 

By straightforward computation, we hnd 

d K{z) ^ £^K{z)-ha{z) -K{z) • £^K{z) 
dak ha{z) (ha{z)Y 

k z^ ■ ha{z) — z^ ■ h'^{z) f z^ 

(ha{z)Y Vha(2:) 


The integrand is analytic on the compact path C, hence we can differentiate inside the integral: 



related representation to (l25ll has been used recently in [l^ in connection with the entropy function. 
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We can now take the limits as i? —)■ +oo and e —)■ 0. Since k <n — l^ the integral over Cr tends to 
zero (the length of Cr is 2ttR, while the fnnction is of order 0{R~^\ogR)). The integral over Ce 
also tends to zero, since for k>l, the integrand is of order 0(log£) and the length of is 2718. 

We conclnde: 


df{a) 

dak 


1 

ni 


lim 

£\0,-R—>-oo 




'C+UC_ ha[z) 



1 

ni 


lim 

£\0,-R—>-oo 



\ 

— Tii) — (log t + Tii)) — - dt I 

ha{t) J 


pR 1 

2 lim / - dt 

e\0,R-^ooJ^ h^{t) 


2 



^k—l 

— - dt > 0 . 

ha{t) 


(28) 
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